Towards a comprehensive open repository of Polish language resources

نویسندگان

  • Maciej Ogrodniczuk
  • Piotr Pezik
  • Adam Przepiórkowski
چکیده

The aim of this paper is to present current efforts towards the creation of a comprehensive open repository of Polish language resources and tools (LRTs). The work described here is carried out within the CESAR project, member of the META-NET consortium. It has already resulted in the creation of the Computational Linguistics in Poland website containing an exhaustive collection of Polish LRTs. Current work is focused on the creation of new LRTs and, esp., the enhancement of existing LRTs, such as parallel corpora, annotated corpora of written and spoken Polish and morphological dictionaries to be made available via the META-SHARE repository. Efforts are made to ensure a high level of reusability of the LTRs by adhering to widely accepted annotation and interoperability standards. Last but not least, since the great majority of the Polish CESAR resources are released under open licenses, special work is required to clarify their Intellectual Property Rights status.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Mapping Thesauri onto plWordNet

plWordNet, the wordnet of Polish, has become a very comprehensive description of the Polish lexical system. This paper presents a plan of its semi-automated integration with thesauri, terminological databases and ontologies, as a further necessary step in its development. This will improve linking of plWordNet into Linked Open Data, and facilitate applications in, e.g., WSD, keyword extraction ...

متن کامل

[Psychiatric rehabilitation in Poland. Polish literature review from 1990-2007].

This article provides a comprehensive overview of contemporary psychiatric rehabilitation and community psychiatry tendencies in Poland. On the basis of articles published in the years 1990-2007 in renowned Polish-language journals (Polish Psychiatry, Advances in Psychiatry and Neurology), an attempt was made to establish mainly directions in polish community psychiatry. Authors review the pres...

متن کامل

Open Resources for Language Technology

NLPFARM is an Open Source code repository for development and sharing of language technology resources. NLPFARM hosts a number of projects covering various language technology needs, providing possibilities to develop more robust and well-formed applications. NLPFARM has been in use for more than a year and our experience is that it has facilitated co-operation and sharing of resources but that...

متن کامل

Attitudes towards English Language Norms in the Expanding Circle: Development and Validation of a new Model and Questionnaire

This paper describes the development and validation of a new model and questionnaire to measure Iranian English as a foreign language learners’ attitudes towards the use of native versus non-native English language norms. Based on a comprehensive review of the related literature and interviews with domain experts, five factors were identified. A draft version of a questionnaire based on those f...

متن کامل

The Spanish DELPH-IN grammar

In this article we present a Spanish grammar implemented in the Linguistic Knowledge Builder system and grounded in the theoretical framework of Head-driven Phrase Structure Grammar. The grammar is being developed in an international multilingual context, the DELPH-IN Initiative, contributing to an open-source repository of software and linguistic resources for various Natural Language Processi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012